Properties of the Box-Cox transformation for pattern classification
نویسندگان
چکیده
The Box–Cox transformation [1,2] (Box and Cox, 1964; Sakia, 1992) has been regarded as a parametric pre-processing technique aimed at making the distribution of a set of points approximately Gaussian. Since normality represents an assumption underlying many statistical data analysis tools, such technique has been widely applied in different fields of Computer Science. In this paper we will provide evidence that this technique can be useful also in the case of Pattern Classification, where Gaussianity of datasets is not so critical. By letting the Box–Cox transform work in operational ranges which do not necessarily correspond to an increase in Gaussianity, we will show that class separability can be improved: this is likely due to the non linear nature of the Box–Cox transformation, which deforms the space in a nonuniform way. We will also provide some suggestions on criteria that can be used to automatically estimate the best parameter of the Box–Cox transformation in the Pattern Classification context. & 2016 Elsevier B.V. All rights reserved.
منابع مشابه
A Simple Transformation Method in Skewness Reduction
Statistical analysis of non-normal data is usually more complicated than that for normaldistribution. In this paper, a simple root/power transformation technique developed by Niaki, et al [1]is extended to transform right and left skewed distributions to nearly normal. The value of theroot/power is explored such that the skewness of the transformed data becomes almost zero with anacceptable err...
متن کاملSmall Area Estimation of the Mean of Household\'s Income in Selected Provinces of Iran with Hierarchical Bayes Approach
Extended Abstract. Small area estimation has received a lot of attention in recent years due to necessity demand for reliable small area statistics. Direct estimator may not provide adequate precision, because sample size in small areas is seldom large enough. Hence, by employing models that can use auxiliary information and area effects in descriptions, one can increase the precision of direct...
متن کاملدو روش تبدیل ویژگی مبتنی بر الگوریتم های ژنتیک برای کاهش خطای دسته بندی ماشین بردار پشتیبان
Discriminative methods are used for increasing pattern recognition and classification accuracy. These methods can be used as discriminant transformations applied to features or they can be used as discriminative learning algorithms for the classifiers. Usually, discriminative transformations criteria are different from the criteria of discriminant classifiers training or their error. In this ...
متن کاملناهمگنی اجزای واریانس پروتئین شیر در سطوح مختلف تولید گله- سال و تاثیر آن بر پارامترهای ژنتیکی و ارزش اصلاحی برآورد شده گاوهای هلشتاین ایران
This study was carried out to investigate different data transformation methods on homogeneity and heterogeneity of variance components. Data included 305-day lactation records for protein yield from the first three lactations of Iranian Holstein cows collected from 1983 to 2014 by the Animal Breeding Center and Promotion of Animal Products of Iran. Data included 141670 records for 1st lactatio...
متن کاملClassification and properties of acyclic discrete phase-type distributions based on geometric and shifted geometric distributions
Acyclic phase-type distributions form a versatile model, serving as approximations to many probability distributions in various circumstances. They exhibit special properties and characteristics that usually make their applications attractive. Compared to acyclic continuous phase-type (ACPH) distributions, acyclic discrete phase-type (ADPH) distributions and their subclasses (ADPH family) have ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Neurocomputing
دوره 218 شماره
صفحات -
تاریخ انتشار 2016